An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria

نویسندگان

  • N. RUIZ REYES
  • M. ROSA ZURERA
  • F. LOPEZ FERRERAS
  • D. MARTINEZ MUÑOZ
چکیده

This paper outlines an adaptive wavelet-based perceptual audio coding scheme attending to various entropy-type criteria. Its performance using some different wavelet families and various filter lengths and decomposition depths has also been investigated. An optimal choice of these parameters is accomplished in order to evaluate both quality and bit rate of compressed signals for four different entropy-type criteria and four representative samples of audio material. The proposed coding scheme performs a periodized wavelet packet transform for each audio frame leading to a decomposition tree which is adapted to the characteristics of the audio frame attending to some entropy criterion. After time-frequency mapping, a thresholding to zero step is carried out to take advantage of entropy coding methods. Next, an uniform quantifier controlled by a psychoacoustic model taking advantage of the masking effect in human hearing is used. Finally, statistical redundancies of audio signals are reduced by using Huffman and run length coding. Experimental results indicate that the proposed approach can achieve almost transparent coding of monophonic CD quality audio signals at bit rates of approximately 64 kb/s (1.45 bit/sample). In addition, the use of the periodized wavelet transform leads to lower coding delay than other similar methods in the literature. The performance of our method is compared to some non-adaptive wavelet-based methods and to MPEG standard in terms of compression versus quality performance. IMACS/IEEE CSCC'99 Proceedings, Pages:3441-3445 Key-Words: wavelet-based audio coding, psychoacoustic model, entropy-type criteria, MPEG

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tree and filter optimization for audio compression in a wavelet-based perceptual audio coder

This paper outlines a new perceptual low bit rate audio coding scheme based on adapted wavelet representations. It claims wavelet tree and filter adaptation attending to a perceptual entropy-based method. To achieve such adaptive structure, a periodized wavelet packet transform is performed for each audio frame. After the transform, the encoder employs scalar adaptive quantization, controlled b...

متن کامل

High Quality Audio Compression Using anAdaptive Wavelet Packet Decomposition andPsychoacoustic

| This paper presents a technique to incorporate psychoacoustic models into an adaptive wavelet packet scheme to achieve perceptually transparent compression of high quality(44.1 KHz) audio signals at about 45 KBits/sec. The lter bank structure adapts according to psychoacous-tic criteria and according to the computational complexity that is available at the decoder. This permits software imple...

متن کامل

Perceptual Coding of Audio Using Signal - AdaptiveFilterbanks

This thesis studies the application of signal-adaptive lter banks in perceptual coding of audio with an emphasis on Wavelet Filter Banks (WFB). It provides an overview of perceptual coding of audio, the motivating psychoacoustic principles, transforms and wavelet theory. Additionally, di erent existing wavelets-based audio coding schemes are presented. The aim of most of the schemes is to overc...

متن کامل

Best wavelet-packet bases for audio coding using perceptual and rate-distortion criteria

This paper presents a new approach to the adaptation of a wavelet filterbank based on perceptual and rate-distortion criteria. The system makes use of a wavelet-packet transform where each subband can have an individual time-segmentation. Boundary effects can be avoided by using overlapping blocks of samples and therefore switching bases is possible at every tree-level without affecting other s...

متن کامل

Watermark Bit Rate in Diverse Signal Domains

A study of the obtainable watermark data rate for information hiding algorithms is presented in this paper. As the perceptual entropy for wideband monophonic audio signals is in the range of four to five bits per sample, a significant amount of additional information can be inserted into signal without causing any perceptual distortion. Experimental results showed that transform domain watermar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1988